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FOREWORD

This Indian Standard (first Revision) which is identical with ISO/IEC 4873 : 1991 `Information technology - IS0 8-bit code for information interchange - Structure and rules for implementation' issued by the International Organization for Standardization (ISO) and International Electrotechnical Commission (IEC) was adopted by the Bureau of Indian Standards on the recommendation of Computer Hardware Sectional Committee and approval of the Electronics and Telecommunication Division Council. This standard was originally published in 1982 and was largely based on IS0 4873 : 1979. Subsequent upon the revision of IS0 4873 : 1979, this standard is now being revised to take into account the latest technological developments in this field. The text of the ISO/IEC standard has been approved as suitable for publication as Indian Standard without deviations. Certain conventions are, however, not identical to those used in Indian Standards. Attention is particularly drawn to the following: Wherever the words `International as `Indian Standard'. CROSS REFERENCES In this Indian Standard, the following places the following: In terna tional Standard ISO/IEC 646 : 1991 Information technology - IS0 7-bit coded character set for information interchange ISO/IEC 2022 : 1986 Information processing IS0 7-bit and 8-bit coded character sets - Code extension techniques International Standards are referred to. Read in their respective Degree of Equivalence Identical Standard' appear referring to this standard, they should be read

Corresponding Indian Standard IS 10315 : 1997 7-bit coded character set for information interchange (first revision) IS 12326 : 1987 7-bit and 8-bit coded character sets - Code extension techniques

do

The concerned technical committee has reviewed the provisions of following standards referred in this adopted standard and has decided that these are acceptable for use in conjunction with this standard: ISO/IEC 6429 : 1992 ISO/IEC 10367 : 1991 Information technology character sets Information technology use in 8-bit codes IS0 International sequences Control functions for 7-bit and 8-bit coded sets for

Standardized

coded graphic character

IS0 2375 : 1985

register of coded character

sets to be used with escape
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lndian Standard

8-BIT

CODED

CHARACTER

INFORMATION

SET FOR INTERCHANGE

( First Revision )
1 scope
This International Standard specifies 646. with the code extension techniques specified in an &bit code derived from, and compatible with, the 7-bit coded character set specified in ISO/IEC

The characteristics IS0 2022.

of this code are also in conformance

This International

Standard

specifies an 8-bit code with a number

of options.

It also provides guidance

on

how to exercise the options to define specific versions. This code is primarily intended for general information interchange within an 8-bit environment among

data processing systems and associated equipment, graphic characters and control functions

and within

data communication

systems. The need for

in data processing has also been taken into account. and

The code includes the 10 digits as well as the 52 small and capital letters of the basic Latin alphabet may include accented letters, special Latin letters and/or the letters of one or several non-Latin

alphabet(s).

2 2.1

Conformance
Conformance Conformance

and implementation

2.1.1 A

of information

interchange (CC-data-element) within Standard coded information for interchange is in

coded-character-data-element with that CC-data-element

conformance within

a version of this International conform

if all the coded representations

of characters

to the requirements the version adopted.

of clause 9.

A claim of conformance
2.1.2

shall identify

Conformance

of devices
with this International and 2.1.2.3. in 2.1.2.1, Standard if it conforms to the requirements of 2.1.2. I, which

A device is in conformance and either contains the description 2.1.2.1

or both of 2.1.2.2

A claim of conformance

shall identify

the document

specified

and shall identify the version adopted.

Device description
to this International Standard in 2.!.2.2 shall be the subject of a description and 2.1.2.3. that identifies

A device that conforms made available 2.1.2.2

the means by which the user may supply characters to the device, or may recognize to him, as specified respectively devices from

them when they are

Originating

An originating

device shall allow its user to sui:c)i) rtr~y sequence of characters

the version adopted,

and shall be capable of transmitting

their coded icill ~~r;lr.~)rur~s within a CC-data-element.
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2.1.2.3

Receiving
device

de\ ices
shall he capable of and receiving that and interpreting to 2.1.1, can any coded representations the corresponding among those of of characters characters the version

A receiving
that available adopted, 2.2 The to

are within its

a CC-data-element, user in such

conform user

and shall .make them from

a way them

that

the

identify

and can distinguish

from

each other.

Implement:*l~i~n use of thl<, ~.odt~ requires punched tapes, definitions cards, of its implementation magnetic either and optical in various media media. and For example, these could thus in a

include permitting physical

punched of data

transmission

channels, recording and

interchange medium,

to take

place

indirectly units (such

by means as input

of an intermediate and output devices

or by local connection transmission of this code

of various

computers)

or by means The error

of data

equipment. in physical media and for transmission, taking into account the need for

implementation checking,

is the subject

of other

International

Standards.

3 Normative
The this following International to

references
standards contain At parties provisions the time to which, through reference in this indicated text, were constitute valid. All are provisions standards encouraged Members of are to of Standard. and of publication, based recent the editions on this

subject

revision, the

agreements the most valid

International

Standard listed

investigate IEC and

possibility maintain 199

of applying

editions

of the standards Standards.

below.

IS0 646:

registers

of currently

International

ISO/IEC IS0 2022:

1 Infomtation

rechnology processing

- IS0 7-bil coded characler
IS0 7-bir and 8-bir

sei for information character seu

interchange.

1986

In,tortnation icchniyucs.

coded

- Code

exrension

I.,O/IEC

6429:`) 10367:

InformaGon

rechnology

Control fttnclions for 7-bit and g-bit coded Sumdurdized coded graphic character

characrer seu for

sets. use in g-bit

ISOAEC

1991 Informurion codes.

technology

IS0

lnrernarional

Register of Coded

Characrcr

Scrs to be Used wirh Escape

Sequences

(IS0

2375)

4 Definitions l;or the
4.1
graphic purpose of this International Standar-d position the following which definitions the apply. graphic symbol representing the next

active

position

: The
01 relative

character to which

is to image function

character
h'O7`1~

the next control
ib indicated

is to be executed.

In general.

the active position

in a display by a curx)r.

4.2 4.3 4.4 4.5 4.6

bit combination byte

: An ordered
that

set of bits used for the representation upon as a unit. used for thr that oi-ganization,

of characters.

: A bit string

is operated

character character

: A member
position

of a set of elements of a display

control

or representation a graphic information with one

of data. symbol. that or I\

: The portion

i$ imaging + An

or is capable of

of imaging

coded-character-data-element to consist standards of a sequence for coded

(CC-data-element) of coded bets

element

interchanged in accordance

:;!,c*cified identified

representations

of characters,

more:

character-

--i) To ix: published
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NOTES I. In a communication CCdata-element (PPDU) 2. When defined will environment form all or according part of the IO the Reference information Model for Open Systems Interconnection to the (IS0 740X). a

that

corresponds

Presentation-Protocol-Data-Unit

in that International interchange

Standard. is accomplished by means of inlerchangeable media, a CC-data-element formatting will form all

information

or part of the informalion

that corresponds

IO the user data, and not that recorded

during

and initialization.

4.7

coded character

set; code : A set of unambiguous

rules that establishes a character

set and the one-to-

one relationship 4.8

between the characters of the set and their bit combinations.

code extension

: The techniques for the encoding of characters that are not included

in the character

set of a given code. 4.9 code table : A table showing the character allocated to each bit combination character in a code. consists of a single bit

4.10 control combination. 4.11 control

: A control

function

the coded

representation

of which

function

: An action that affects the recording,

processing, transmission,

or interpretation

of

data, and that has a coded representation 4.12 device : A component information
NOTE gateway

consisting of one or more bit combinations. processing equipment which can transmit, and/or receive, coded

of information

within

CC-data-elements.
be an input/output device in the conventional sense, or a process such as an application program or

- It may function.

4.13 escape sequence procedures. 4.14 Final

: A string of bit combinations

that is used for control

purposes ESCAPE.

in code extension

The first of these bit combinations Byte : The bit combination character

represents the control function

that terminates other than

an escape sequence or a control sequence. a control function, that has a visual representation

4.15 graphic normally

: A character,
printed

handwritten,

or displayed,

and that has a coded representation

consisting of one or more

bit combinations. 4.16 graphic 4.17 repertoire combinations symbol

: A visual representation
set of set.

of a graphic character or of a control function that are represented by means of one or more bit

: A specified

characters

of a coded character

4.18 user : A person or other entity that invokes the services provided by a device.
N( )`l`t?S I. `l`hib entlly funccwn, may he a process such a\ an application program II the "device" is a code convertor or a gatew.3)

for example. as supplwd by the uwr or made available provided IO him, may be In the form of code3 local to the device. or

I. The characters. of non-conventional

visible reprewnlatlons.

that 2 1.2 above is satisfied.

5 Notation, 5.1
Notation

code table and names

The bits of the hit combinations bx is the highest-order, The bit colnbinatioll: attl-ihuting

of Ihe X-bit code 31-e identified

by bs, b7, bh, bs, b4, b3, bz and bl, where or least-significant, bit. notation I)y

or most-signiflc;lnt

bit, and bl is the lowest-order,

may be inter p~ctcd 10 repI-esent integers in the range 0 to 255 in binary weights to the individual bits:

the following
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Bit Weight

b8 128

b7 64

b6 32

ki 16

b4 8

b3 4

b2 2

bl 1

In this international and

Standard,

the bit combinations

are identified

by notations of the form xx/yy, where xx between the notations of the form xx/yy

yy are numbers in the range 00 to 15. The correspondence
xx is the number and I respectively; represented

and the bit combinations

consisting of the bits bg to bt, is as follows: by bs, b7, bh and b5 where these bits are given the weights 8, 4, 2

yy is the number represented by ba, bj, b2 and bt where these bits are given the weights 8, 4, 2
and 1 respectively. of the form xxIyy are the same as the ones used to identify code table positions, and yy is the row number (see 5.2). where xx is

The notations the column 5.2

number

Code table code table consists of 256 positions arranged 00 to 15. by notations of the form xx/yy, where xx is the column number and in 16 columns and 16 rows. The columns and rows

An 8bit

are numbered

The code table positions are identified yy is the row number.

The positions
The notation combination. 5.3 This Names

of the code table are in one-to-one of a code table position,

correspondence

with the bit combinations

of the code. bit

of the form

xxJyy. is the same as that of the corresponding

International characters only

Standard

assigns one name to each character. SPA(`E' and DELETE,

In addition,

it specifies an acronym symbol

for

control For

and for the characters only capital letters, capital

and a graphic

for each graphic and this

character. convention

By convention, be retained

letters, and hyphen are used for writing are used. It is intended of the text.

the names of the characters. that the acronyms

acronyms

space and digit

in all translations

The names chosen to denote graphic characters are intended to reflect their customary this International Neither Standard does not define and does not restrict the meanings does it specify a particular

meaning.

However, characters.

of graphic

style or font design for the graphic characters when imaged.

6 Structure
6.1 Elements

of the 8-bit code
of the S-bit code parts (see figure 1).

The 8-bit code consists of the following a) A CO set

A set of up to 30 control

characters

represented

by bit combinations

OO/OO to 01/15,

except 00/14 and OO/lS which shall be unused.

b)
c)

The character

SPACE
represented by bit combination 02/00.

A graphic character

A GO set
A set of 94 graphic characters represented by bit combinations 02/01 to 07/14.
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4

The character DELETE A character represented by bit combination
07115.

e)

A Cl set A set of up to 32 control characters represented by bit combinations 08/00 to 09115.

f1

A Cl

set by bit combinations lo/O0 to 15115.

A set of up to 96 graphic characters represented g) A

G2 set

A set of up to 96 graphic characters.
h) A G3 set A set of up to 96 graphic characters. 6.2 Identification of the elements of the g-bit code

The method of identification 6.3 6.3.1 Invocation CO set

of the code elements listed in 6.1 is specified in clause 10.

The identification 6.3.2

of the CO set also invokes that set.

Character SPACE
SPACE shall be represented by bit combination 02/00. It is not explicitly invoked.

The-character 6.3.3 GO set

The GO set shall be as specified 6.3.4 Character

in 7.4. It is not explicitly

invoked.

DELETE
DELETE shall be represented by bit combination 07115. It is not explicitly invoked

The character 6.3.5 Cl set

The identification 6.3.6 G1 set

of the Cl set also invokes that set.

The identification Gl set. G2 set

of the Cl

set also invokes that set. The locking-shift

function

LSlR

shall also invoke

the

6.3.7 Lither

the set as a whole shall be invoked

by the locking-shift

function

LS2R (see annex B) into columns function SS2, (see 7.6).

IO to IS, or individual 6.3.8 Either 7.6). G3 sel

characters of it shall be invoked by means of the single-shift

the set as ;I whole shall be invoked

by the locking-shift

function

LS3R (see annex B) into columns function SS3 (see

10 to 15, or individu;ll

characters of the set shall be invoked by means of the single-shift

7 Specification
of composite

of the characters

of the s-hit

code
RETURN, for the coded representation

The use of control functions such as BACKSPA(T1'
char-;lcters is prohibited

01 CARRIAGE

by this In!e~~n;rtron;ll Standard (see annex C).
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7.1

CO set
for the CO set are 00/14 and 00/15 shall not be used (see annex D); ESCAPE shall be represented by bit combination Ol/ll; subject to the restrictions

The requirements -

bit combinations the control any control specified

character characters

can be allocated

to the other bit combinations

in annex A.
only ESCAPE represented by bit combination 01111 has been registered (Registration

NOTE ISO-IR

- A CO set comprising No. 104). and is identified

by ESC 02101 WO7.

7.2

Character

ESCAPE
character used to form escape sequences. In this International Standard the use of

ESCAPE

is a control

escape sequences is specified

in clause 10.

Table 1 - ESCAPE

ESC

ESCAPE

Ol/ll

7.3

Character

SPACE
having a visual representation consisting of the absence of a graphic symbol. position. It causes

A graphic character

the active position to be advanced by one character

Table 2 - SPACE

1 Acronym

/

Name

)

reprZ~tion

I
7.4
The

SP

I

SPACE

I

02/00

GO set
94 bit combinations 02/01 to 07/M are used to represent to 07/14 graphic characters. All graphic characters

allocated Standard

to bit combinations

in the r::nge 02/Ol

are spacing characters,

that is they cause the by this International

active position

to advance b\, nn~ character

position. The graphic characters allocated

to these 94 bit com':Gnations are specified in table 3.

IS 10401: 1998 ISO/IEC 4873 : 1991

Table 3 - Graphic characters of the GO set

Graphic Symbol ! ; :

Name EXCLAMATION MARK QUOTATION MARK NUMBER SIGN DOLLAR SIGN PERCENT SIGN AMPERSAND APOSTROPHE LEFT PARENTHESIS RIGHT PARENTHESIS ASTERISK PLUS SIGN COMMA HYPHEN-MINUS FULL STOP SOLIDUS DIGIT'ZERO DIGIT ONE DIGIT TWO DIGIT THREE DIGIT FOUR DIGIT FIVE DIGIT SIX DIGIT SEVEN DIGIT EIGHT DIGIT NINE COLON SEMICOLON LESS-THAN SIGN EQUALS SIGN GREATER-THAN SIGN QUESTION MARK COMMERCIAL AT LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER

Coded

representation 02/01 02/02 02/03 02/04 02/05 02/06 02/07 02/08 02/09 02/10 02/11 02/12 02/13 02/14 02/15 03/00 03/01 03/02 03/03 03/04 03/05 03/06 03/07 03/08 03/09 03/10 03/11 03/12 03/13 03/14 03/15 04/00 04/01 04/02 04/03 04/04 04/05 04/06 04/07 04/08 04/09 04/10 04/11 04/12 04/13 04/14 04/15 05/00
(continued)

84 ;

) * + , i
0

:
3 4 2 ii 9 : . , < = > ?
A B C cl E : H I J K Ii i P

A B C D E F G H I J K L M N 0 P
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Table 3 - Graphic'characters

of the GO set

Graphic Symbol

Name

Coded
representation

Q i
T U V W X Y z [ ; T ; : e f 9 h i j k 1 m n
0 :

r 5 u v
W X

Y

i I

LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LATIN CAPITAL LETTER LEFT SQUARE BRACKET REVERSE SOLIDUS RIGHT SQUARE BRACKET CIRCUMFLEX ACCENT LOW LINE GRAVE ACCENT LATIN SMALL LETTER A LATIN SMALL LETTER 6 LATIN SMALL LETTER C LATIN SMALL LETTER D LATIN SMALL LETTER E LATIN SMALL LETTER F LATIN SMALL LETTER G LATIN SMALL LETTER H LATIN SMALL LETTER I LATIN SMALL LETTER J LATIN SMALL LETTER K LATIN SMALL LETTER L LATIN SMALL LETTER M LATIN SMALL LETTER N LATIN SMALL LETTER 0 LATIN SMALL LETTER P LATIN SMALL LETTER Q LATIN SMALL LETTER R LATIN SMALL LETTER S LTIN SMALL LETTER T LATIN SMALL LETTER U LATIN SMALL LETTER V LATIN SMALL LETTER W LATIN SMALL LETTER X LATIN SMALL LETTER Y LATIN SMALL LETTER 2 LEFT CURLY BRACKET VERTICAL LINE RIGHT CURLY BRACKET TILDE

Q R S T U V W X Y Z

-----

05/01 05/02 05/03 05/04 05/05 05/06 05/07 OS/O8 05/09 05/10 05/11 05/12 n5/13 us;/14 05/15 06/00 06/01 06/02 06/03 06/04 06/05 06/06 06/07 06/08 06/09 06/10 06/11 06/12 06/13 06/14 06/15 07/00 07/01 07/02 07/03 07/04 07/05 07/06 07/07 07/08 07/09 07/10 07/11 Q7/12 07/13 ST/l4
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7.5
DEL

Character

DELETE used to erase or obliterate
or

was originally without

an erroneous characters content of

or unwanted stream,

character such

in punched action may

tape. DEL a data the affect

may be used for media-fill stream affecting information

time-fill.

DEL

may be inserted that but

into, or removed

from,

the

information

layout and/or the control of equipment.

Table 4 - DELETE

Acronym

/

Name

1

reprZ,ZZLm

DEL

1 .DELETE

1

07115

7.6

Cl set
set is available for up to 32 control characters in addition 6429. O&O0 to 08/13 and 09100 to 09115 by this to those provided by the CO set. It shall

The Cl

not include any of the control characters of the CO set of IS0 No specific control lnternational When 08/15,
NOTE

characters

are allocated

to bit combinations

Standard. functions SS2 and SS3 are used, they shall be allocated to bit combinations shall not be used.
ISO-IR No.

the single-shift respectively,

08114 and

otherwise

these bit combinations

- A Cl set comprising

only SS2 and SS3 allocated co these bit combinations has been registered (Registration

105). and is identified by ESC OLlO,7 03A17.

7.7

Cl set
set shall be either a 94-character or a 96character set of graphic characters. This set is available

The Cl

for graphic character-s in addition
Either a unique graphic charxter

to those provided by the GO set.
shall be allocated

to each bit combination

or the bit combination

shall be

declared

unused. of the G1 set are represented 10100 to IV15 by bit combinations 10101 to 15/14 if the Cl set. is a 94-character

The characters

set, or by bit combinations 7.8 G2 set

if the Cl

set is a 96-character

The G2 set shall be either for graphic characters Either

a 94.character

or a 96character

set of graphic characters. sets.

This set is available

in addition

to those provided by the GO and the Gl shall be allocated to each bit combination

a unique graphic character unused.

or the bit combination

shall be

declared

If the G2 set is ;I O-I-character 15/15. The characters function LSZK. When invoked

set, then no characters

shall be allocated

to bit combinations

10100 and

of the G2 set shall be invoked either by the single-shift

function

SS2 or by the locking-shift

by SS2, each character

is represented set.

by the bit combination

of SS2 followed

by

one of the bit combinations

in the range 02/01 to 07114 if the G2 set is a 94.character

set, or

02100 to 07115 if the G2 set is a 96-character

9
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When

invoked

by LSZR, the characters of the G2 set are represented set, or by bit combinations set.

by bit combinations

lo/O1

to 15/14 if the G2 set is a 94-character a %-character 7.9 The C3 set G3 set shall be a 94-character in addition

IO/o0 to IS/15 if the G2 set is

or a 96-character

set of graphic

characters.

This set is available

for

graphic characters Either

to those provided by the GO, the Cl

and the G2 sets. or the bit combination shall be

a unique graphic character unused.

shall be allocated to each bit combination

declared

If the G3 set is a 94-character The characters function LS3R. When invoked

set, then no character shall be allocated

to bit combinations
function

lO/OO and 1315.

of the G3 set shall be invoked either by the single-shift

SS3 or by the locking-shift

by SS3. each character

is represented set.

by the bit combination

of SS3 followed

by

one of the bit combinations

in the range 02/01 to 07/14 if the G3 set is a 94-character

set, or

02100 to 07/15 if the G3 set is a 96-character When invoked

by LS3R, the characters of the G3 set are represented set, or by bit combinations set. of the &bit code of the elements of the 8-bit code.

by bit combinations

lO/Ol

to 15/14 if the G3 set is a 94-character a 96-character 7.10 Summary Figure

lO/OO to 1505

if the G3 set is

of the specification

1 summarizes

the specification

8 Levels
This International 8.1 Standard specifies three nested levels of implementation.

Level

1
facilities:

Level 1, see figure 2, comprises the following At Level a CO set; the character the GO set; the character a Cl set; aGlset. 1 no shift functions DELETE represented SPACE represented

by bit combination

02/00;

by bit combination

07115;

shall be used and the GO and Cl

sets are assumed to be invoked permanently

in columns At Level addition GO set.

02 to 07 and 10 to 15, respectively. set may be empty if there is no requirement for control characters in

1 the Cl set and/or the Gl to those provided

by the CO set and/or for graphic characters in addition

to those provided

by the

At Level 1 a version shall not include a G2 or a G3 set. 8.2 Level 2

Level 2, see figure 3, comprises the facilities of Level 1 and in addition to them a G2 set the characters of which shall be invoked individually a G3 set the characters of which shall be invoked individually 10 by SS2; by SS3.

1s 10401: 1998 ISO/IEC 4873 : 1991

At Level 2 no other shift functions shall be used. The Gl set shall not be empty; either the G2 or the C3 set may be empty but not both. it shall contain at least the single-shift functions SS2 and SS3.

The Cl set shall not be empty, 8.3 Level 3 4, comprises

Level 3, see figure functions: LSlR LSZR LS3R

all the facilities

of Level 2, with the addition

of the following

three shift

-

The Gl The

set shall not be empty; either the G2 or the G3 set may be, but not both. explicitly by LSlR, LSZR and LS3R, respectively. Individual

G 1, G2 and G3 sets can be invoked

characters of the G2 and G3 sets can be invoked by SS2 and SS3, respectively. The Cl set shall not be empty, it shall contain at least the single-shift functions SS2 and SS3.

9 Version of the S-bit code
9.1 Contents of a version
A version of the 8-bit code is a coded character set, a Cl and a Gl set and, optionally, imple`mentation (clause 8) shall also be identified. be the subject of a specification document which states how the above options Standard. set in accordance with clauses 6, 7 and 8, in which a CO identified. The level of a G2 and a G3 set have been uniquely

Such a version will generally

have been exercised. Such a specification 9.2 Unique coding of characters the same character

is said to be in accordance with this International

In a version character

may occur in more than one of the GO, Gl, as a character International in another or IS0

G2 and G3 sets. Such a define

shall be regarded

as the same character the specifications,

of those sets if both characters

have the same name within the two sets. If the same character set itself or within which the character

Register entries, that respectively

has been allocated to more than one of the GO, Gl, repertoire from taken the lowest numbered

G2 and G3 sets, either within G2, G3)

the in

the character

associated with that set, then that character

shall be represented

by the coded representation

set (in the sequence GO, Gl,

has been allocated. for such a character numbered invoked. within one of the other, invoked higher numbered numbered sets shall not be set in which the

A coded representation
~rsecl,

cvrn

if the

higher

set is already

and the lowest

char,rcter

is allocated

is not currently

10 Identification

of version and level

10.1 Purpose and context of identification
CC-data-elements a composite identification unit conforming of coded to a version of this International information that is interchanged Standard Standard are intended between a sender to form all or part of and a recipient. The shall by the originator

of the version of this International to the recipient. Standard.

that has been adopted

also be available

The route by which such identification

is communicated

to the recipient

is outside the scope of this International

II

IS 10401: 1998 ISO/IEC 4873 : 1991

However, information.

some standards

for interchange

of coded information

may permit,

or require,

that the coded

representation

of the identification Standard.

applicable

to the CC-data-elements form

forms a part of the interchanged of a version and a level of data element,

This clause specifies a coded representation Such coded representations interchange in information

for the identification

this International

all or part of an identifying

which may be included 10.2 Identification

in accordance with the relevant standard.

of level Standard shall be identified by means of an announcer sequence from the

A level of this International following list.

ESC 02/00 04112 shall identify ESC 02/00 04113 shall identify ESC 02/00 04/14 shall identify 10.3 Identification

Level 1. Level 2. Level 3.

of a version
of a version of this International Standard shall comprise a set of identifications, one for

The identification designating ESC 02/01

each of the C sets and G sets that constitute

the version. Each identification

in the set shall consist of a

escape sequence of the type shown below. F shall identify the CO set. the Cl set. the GO set (Registration the Gl set. the G2 set. the G3 set. from tne International Register (IS0 2375). If any of ISO-IR No. 6).

ESC 02/02 F shall identify

ESC 02108 04102 shall identify

ESC 02/09 F or ESC 02/13 F shall identify ESC 02/10 F or ESC 02/14 F shall identify ESC 02/11 F or ESC 02115 F shall identify

The final byte F of these sequences shall be obtained the C sets or G sets is empty, is 07/14. For a version of this International sets shall be included.
NOTE - The designating escape sequence Whilst

the identification

shall be the same escape sequence in which the final byte F

Standard that conforms

to Level 1, no identifications

for the G2 and G3

for the GtI set is indicated

with the tinal byte which

corresponds of ISOAEC

to the GO set 4873, it has

of this International

Standard.

this designation

is not necessary according of this International

IO this edition Standard

been kept for reasons of compatibility

with the 2nd edition

10.4 Switching
In information

from one version to another
interchange. the use of a different version requires that this version be identified. Thus, if

any of the C sets or of the G sets is changed, an announcer sequences according 10.5 to IO,3 shall be required.

sequence according to 10.2 and a set of escape

Switching

from one level to another
interchange, any change of level, whether or not the C sets and G sets are changed, requires to 10.2 and a set of escape sequences according to 10.3.

In information an announcer

sequence according

12
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See 7.1

See 7.6
Figure 1 - Elements of the $-bit code
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Figure 2 - Level 1
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Annex A
(normative)

Restrictions applicable to the CO and Cl sets

The definition processed other formatted treatment Whilst

of some control than serially

functions

in this International direction. direction

Standard

assumes that data associated with functions are included additional in data special

them is to be processed serially in a forward in a forward for fixed-record

When they are included in strings of data which are or when these control effects or may require

processing they may have undesirable in their desired function. specifies requirements

to ensure that they'result Standard

this International

for bit combinations

00114, 0005

and 01/I 1 of the

CO set, it places the following a) If control

restrictions characters

on the use of the remaining

29 bit combinations: 6429 are used, they shall have

described

in the CO set of ISOlIEC

the coded representations b) c)

and the definitions

specified therein.

None of these control characters may be allocated to the Cl set. Transmission information control Standards, control characters are intended networks. to control or facilitate transmission of

over telecommunication for example IS0 1745.

Procedures for the use of the transmission are the subject of other International

characters

on telecommunication

networks

16

IS 10401: 1998 I!So/IEc 4873 : 1991

Annex B
(informative)

Shift functions

The use of the shift functions

mentioned

in this International

Standard is specified

in Is0 2022. They are
Sets to be Used

defined in lSO/IEC 6429 and registered in the International with Escape Sequences. They are reproduced here for convenience only.

Register of Coded Character

LSIR : ESC 07/14 LSZR : Esc 07113 LS3R : ESC 07112

17
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Annex C
(informative)

Composite graphic characters

Generally, graphic character

the specified repertoire

graphic

character

set constituting

the repertoire

of a code is identical 6937

with

the

characters

allocated

to the bit combinations

of that code. In the case of IS0

the graphic

is larger than the coded graphic character set. character imaging of a graphic character the graphic repertoire is represented by a graphic symbol at by one character

In any case, each graphic a character position. By means of the control 6429, same character position. position. After

symbol the active position

is advanced

function This

GRAPHIC

CHARACTER does not

COMBINATION a new

(CCC), character graphic

specified but

in IS0 only a

it is possible to represent

the graphic symbols of more than one graphic presentation constitute

at one and the

symbol

juxtaposition character of characters

of the graphic symbols of the characters considered. This does not create a new coded graphic to the specified repertoire. manner. (BS) or CARRIAGE RETURN (CR), as permitted by other to set consists It only images the graphic symbols of that repertoire functions in a particular BACKSPACE by ISO/lEC

nor does it add a graphic character

The use of the control International Standards,

for example

646, allows the creation of coded characters in addition such new coded character a character of the coded character

those of the coded character of a combination (or of the repertoire

set. Whilst the graphic symbol representing this combined

of two or more graphic symbols, each representing where it is larger), of its components. repertoire

graphic symbot must be regarded as a new graphic

symbol in its own right and not as a juxtaposition Using the latter method not be met. This International Standard characters therefore prohibits

makes the graphic character of the data containing

undefined

and requires agreement

between can often

the sender and the recipient

such additional

characters. This requirement

the use of BACKSPACE as listed exhaustively

or CARRIAGE either

RETURN

to

create additional repertoire,

and requires each graphic character

to be represented in lSO/IEC

by a single S-bit

byte or by the defined

series of bit combinations

6937 where the allowed

which is larger than the coded character set, is fully specified.

Examples
It is not allowed to generate the character function "not equal" by combining the characters EQUALS and

SOLIDUS

by means of the control

BACKSPACE.

=BS/
It is allowed LETTER CCC. to generate the presentation SMALL LETTER T and LATIN

-

#
CAPITAL LETTER P, LATIN SMALL function position by means of the control

of the letters LATIN S at one character

GCCPtS
In the first case a new character case three characters

-

pts
In the second

with a new graphic symbol has been added to the repertoire. have been imaged in a particular way.

of the specified repertoire
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Annex D
(informative)

Use of bit combinations

0004 and 00115

In ISO

2022, the shift functions
in an 8-bit code.

LSO and LSl

are represented

by the bit combinations

00/14

and 00/15,

respectively,

These two bit combinations Bit combinations between Standard. a 7-bit

are not used in this International

Standard. when transcoding

00/14 and OO/lS are available to represent SO and Sl, or LSO and LSI code and an 8-bit code. However, this use is outside

the scope of this International

Annex E
(informative)

Main differences between the second edition (1986) and the puent International Standard

(third) edition of this

I*,,1 The positions.

GO set is fuilv

defined,

i.e. specific graphic

cI~r3...~ "

,. ken

allocated

to all 94 chsrxter

E.2

A hierarchy

of the G sets is specified which allows the a:. ;bii:~ -1 of a unique coded representation

to

a character

when that characicl- is present in more than one G A;.

E-3

The description

:!I the identification

of the C and G sets is separated

from

the specification

of the

coding structure.
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